Picture for Zhixuan Liang

Zhixuan Liang

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Add code
May 28, 2026
Viaarxiv icon

HiVLA: A Visual-Grounded-Centric Hierarchical Embodied Manipulation System

Add code
Apr 15, 2026
Viaarxiv icon

From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

Add code
Mar 16, 2026
Viaarxiv icon

R3DP: Real-Time 3D-Aware Policy for Embodied Manipulation

Add code
Mar 15, 2026
Viaarxiv icon

UltraDexGrasp: Learning Universal Dexterous Grasping for Bimanual Robots with Synthetic Data

Add code
Mar 05, 2026
Viaarxiv icon

VPWEM: Non-Markovian Visuomotor Policy with Working and Episodic Memory

Add code
Mar 05, 2026
Viaarxiv icon

BiManiBench: A Hierarchical Benchmark for Evaluating Bimanual Coordination of Multimodal Large Language Models

Add code
Feb 09, 2026
Viaarxiv icon

Performance-guided Reinforced Active Learning for Object Detection

Add code
Jan 22, 2026
Viaarxiv icon

Expertise need not monopolize: Action-Specialized Mixture of Experts for Vision-Language-Action Learning

Add code
Oct 16, 2025
Viaarxiv icon

Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

Add code
Aug 27, 2025
Viaarxiv icon